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To help developers protect their applications against possible 
misuse, we are introducing the faster and more 
accurate Moderation endpoint. This endpoint provides OpenAl API 


developers with free access to GP I-based classifiers that detect 


undesired content—an instance of using Al systems to assist with 


human supervision of these systems. We have also released both 
a technical paper describing our methodology and 


the dataset used for evaluation. 


When given a text input, the Moderation endpoint assesses 
whether the content is sexual, hateful, violent, or promotes self- 
harm—content prohibited by our content policy. The endpoint has 


been trained to be quick, accurate, and to perform robustly across 
a range of applications. Importantly, this reduces the chances of 
products “saying” the wrong thing, even when deployed to users 
at-scale. AS a consequence, Al can unlock benefits in sensitive 
settings, like education, where it could not otherwise be used 

with confidence. 


The Moderation endpoint helps developers to benefit from our 
infrastructure investments. Rather than build and maintain their 
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own Classifiers—an extensive process, as we document in 
our paper—they can instead access accurate classifiers through a 
single API call. 


As part of OpenAl’s commitment to making the Al ecosystem 


safer, we are providing this endpoint to allow free moderation of all 
OpenAl APlI-generated content. For instance, Inworld, an OpenAl 
API customer, uses the Moderation endpoint to help their Al-based 
virtual characters remain appropriate for their audiences. By 
leveraging OpenAl’s technology, Inworld can focus on their core 
product: creating memorable characters. We currently do not 
support monitoring of third-party traffic. 


Get started with the Moderation endpoint by checking out the 
documentation. More details of the training process and model 
performance are available in our paper. We have also released 

an evaluation dataset, featuring Common Crawl data labeled within 
these categories, which we hope will spur further research in 

this area. 
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